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Abstract 

Linear bio-molecular motors move unidirectionally along a track by coordinating several dif- 
ferent processes, such as fuel (ATP) capture, hydrolysis, conformational changes, binding and 
unbinding from a track, and center-of-mass diffusion. A better understanding of the interdepen- 
dencies between these processes, which take place over a wide range of different time scales, 
would help elucidate the general operational principles of molecular motors. Artificial molecular 
motors present a unique opportunity for such a study because motor structure and function are a 
priori known. Here we describe use of a Master equation approach, integrated with input from 
Langevin and molecular dynamics modeling, to stochastically model a molecular motor across 
many time scales. We apply this approach to a specific concept for an artificial protein motor, 
the Tumbleweed. 
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1. Introduction 

Naturally occurring protein-based molecular motors are a broad class of macromolecules 
that transduce chemical potential energy into directed transport. Examples include rotary motors 
responsible for ATP synthesis (ATP-synthase) and locomotion (fiagella), as well as linear mo- 
tors, which move unidirectionally along a polymeric track and are responsible for, e.g., muscle 
contraction, cargo transport, and DNA/RNA transcription and replication HJGIII). Enabled by 
revolutionary advances in single molecule detection techniques, such as fluorescence microscopy 
and optical tweezers, the activity and performance of individual molecular motors has been di- 
rectly observed (for an overview of techniques and accomplishments see Q). 

Inspired by biomolecular motors, several efforts to construct artificial molecular motors 
have been pursued for the last decade or so, one driving force being the vision of biomimetic, 
nanoscale machinery [5 1. In addition, artificial molecular motors offer an opportunity to develop 
better understanding of the general operational principles of molecular motors, because of the 
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design freedom offered by synthetic motors, and because motor structure and function typically 
are a priori known, allowing for detailed modeling. 

One approach to artificial molecular motors is the use of synthetic, small molecules, which 
has led to a large variety of designs of motors and motor parts f6j |7] [8] |9) including a surface- 
mounted, photochemically driven, rotary motor where a combination of light-driven cis-trans 
isomerizations and thermal relaxations cause a molecule to make a complete 360° rotation in a 
predefined direction IflOl . This particular design has also been subject of detailed modeling IfTTl 
[121 . exemplifying this advantage of synthetic motors. 

In a competing approach [13], and taking advantage of the highly designable, self-organized 
synthesis offered by oligonucleotides , Bath and Turberfield have constructed a linear DNA-based 
motor that is designed to step unidirectionally along a DNA-track [14J. The motor consists of 
two single-stranded (ss) DNA segments as 'feet,' flexibly joined by a double-stranded (ds) linker, 
that attach to complementary sequences on an ss-track. To achieve directional stepping, an ss- 
DNA fuel is introduced which selectively detaches the rear foot by competitive binding, allowing 
the rear foot to diffuse forward and complete a step. 

In a very recent third approach, a design concept for an artificial, protein-based motor was 
proposed [15|, with the aim to develop artificial motors that are based on the same material as 
biological motors. This design, the Tumbleweed, uses externally controlled, ligand-gated binding 
of repressor proteins to a DNA track to achieve unidirectional motion, and will be described in 
more detail below (Fig. [TJ. 

Artificial and biomolecular motors alike must coordinate many processes to achieve unidirec- 
tional motion. In the case of linear DNA- and protein motors, these processes generally include 
fuel (ligand) capture, molecular conformational changes, binding to and unbinding from a track, 
and center-of-mass diffusion. The understanding of the interplay between these processes, which 
can span a vast range of time scales, is central to the design process of artificial motors, and to 
developing a better understanding of molecular motors in general. 

Several different modeling approaches have been used to model molecular motors, each ad- 
dressing different aspects of this many-time-scales problem. First, fundamental approaches to 
directed motion of small, Brownian molecules were developed based an spatially asymmetric 
ratchets lfT6lfT71 and Brownian Motor models lfT8l[19l , and helped understand the role of sym- 
metry breaking, thermal noise and diffusion in molecular motors 11201 . Such models are typically 
not designed to capture the molecule-specific structure-function relationship as well as details 
about sub-step processes of biological motors. 

Molecular dynamics (MD) and Langevin dynamics models (LD), where molecular trajecto- 
ries are calculated from an equation of motion, provide single-molecule information with poten- 
tially high accuracy, but require large amounts of computational time to simulate motor stepping, 
because all short-time processes must be explicitly calculated during the simulation run. One ex- 
ample of this approach are recent LD models of myosin V ll2Tll22ll23l . where the modeling of 
motor stepping was enabled by coarse-graining of the molecule. Fully atomistic molecular dy- 
namics (MD) simulations are currently computationally limited to simulation times on the order 
of nanoseconds, and thus too demanding for full simulations of motor stepping. 

Finally, stochastic models, such as the Master equation approach described in more detail 
below, sacrifice the single-molecule information of dynamic models by modeling transitions 
between ensembles of motor states, but have the great advantage that they allow the modeling of 
motor processes across many different time scales [24] . 

In developing a complex artificial motor, one would thus like to combine the advantages 
of several of the above approaches, specifically the ability to relate atomistic detail to resulting 
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motor performance in terms of stepping rate and run length. Here we describe the integration of 
Master equation (ME) modeling with results from LD and MD simulations in the design process 
of the protein motor Tumbleweed [15|. In the following, we first describe the Tumbleweed 
concept in more detail, followed by a discussion of the time scales of a number processes that 
must be mutually tuned for stepping to result. Some of these time scales, such as the time 
for diffusional search for the next binding site, depend on motor design, are thus tunable to a 
degree, and can be estimated from LD modeling. Using a ME approach, enabled by our detailed 
knowledge of the possible motor states in this designed motor, we can explore the sensitivity of 
the motor design to chosen times and rates. The model output, such as quantitative relationships 
that must be fulfilled for successful motor performance, can then be used in the experimental 
motor design. 



2. The Tumbleweed Motor 



The Tumbleweed (TW) motor concept was developed as a first step towards artficial motors 
based on proteins [ 15 1. It consists of three DNA-binding repressor proteins (Ra, Rb, Rc) attached 
to the corners of a Y-shaped, designed coiled-coil protein hub. The repressor proteins (the motor 
'feet') have characteristically strong binding affinity to a unique ds-DNA base motif only in the 
presence of a specific ligand in solution (a, b, c). Thus the binding activity of each foot can 
be controlled separately by the external ligand supply. The track is then a designed length of 
ds-DNA with cyclic repeats of the three repressor protein binding motifs (binding sites). Step- 
ping of the TW molecule is by center-of-mass diffusion, and directed transport is achieved by 
coordinated binding and unbinding of the motor feet, externally coordinated by cycling ligands 
in solution. The direction of transport is determined by both the binding site arrangement and the 
temporal order of the external ligand supply, and the speed of the motor is determined by how 
quickly ligands can be exchanged (see Fig. [TJ. 



3. Modeling Tumbleweed: A Problem of Timescales 

Consider TW in the state shown at the top of Fig. [T[ with repressors Ra and Rb bound. To 
initiate a step, ligand a in solution is replaced by ligand c. For the step to take place, first the 
binding protein Ra must lose its ligand a, transitioning from its holo- (with ligand) to its apo- 
state (without ligand), and protein Rc must gain its ligand c. Apoprotein R A must then unbind 
from the track, and the motor must diffuse until holoprotein Rc is near its next binding site and 
binds. Throughout this process, the external ligand supply must not be changed, and holoprotein 
Rb must stay attached to the track, or else the motor will fall off and be lost in the solution. 

Qualitatively, these conditions are summarized by the inequality 

1 1 

^— ; <r dlff <r llg < — , (1) 

^on/oft /C Q ^ 

in which k os /oS are the ligand-repressor (dis)association rates, is the diffusional stepping 
(searching) time of the motor, rn g is the time period during which the ligand concentration is 
kept constant, and is the rate for a repressor holoprotein detachment from the DNA track. 

A central modeling question for the design of TW molecule and experimental setup is: What 
are these different time scales and how will their absolute and relative values affect the perfor- 
mance of Tumbleweed? 
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The value of l/k on / g is not well known for our choice of repressor proteins, but is thought 
to be relatively fast compared to any other time scales in the system, on the order of picosec- 
onds ll25ll . Modeling the system on this level requires a full atomistic molecular dynamics 
approach (MD), where the motion and interactions of atoms in the molecule as well as the sur- 
rounding ligands and buffer solution are explicitly calculated. The second term in Eq. [T]is the 
characteristic time for the TW molecule to rotationally diffuse to it's next binding site. To get 
a rough idea of this term, the time scale for a sphere with the approximate diameter of the TW 
molecule (~ 20 nm) to diffuse one binding length (~10 nm) is around 1 /is. To get a detailed 
value for this term, one can also use fully atomistic MD modeling, which allows one to consider 
steric hindraces between molecular components and to understand effective flexibilities of the 
hub-repressor protein joints. For current computational power, the full atomistic MD of the TW 
is able to provide maximum simulation runs of approximately 10~ 9 s, which is near the time 
scale for diffusion of subsections of the molecule, such as one arm of the central hub, but is 
not long enough to completely explore Tdiff. To fully explore how the stepping time depends 
on parameters such as joint flexibility or molecule size, we can coarse-grain the MD model to 
decrease computational time and increase the total simulation run-length time. Inertial motion 
of the TW molecule becomes completely damped by the surrounding fluid for times greater than 
1CT 9 s, and thus can be neglected from the equations of motion for longer times. The atomistic 
picture of the molecule interacting with fluid molecules can then be replaced with a Langevin 
Equation, in which the fluid-molecule interactions are introduced as stochastic thermal noise and 
where molecular components are approximated by geometric objects whose sizes and viscous 
drag coefficients match the original motor components. This over-damped Langevin Dynam- 
ics (LD) approach can then use as input results from a MD model as described above, such as 
steric constraints and joint flexibilities, to correctly approximate the molecular components. For 
TW, the LD approach is applicable to time scales between //s and ms, which is right around the 
center-of-mass diffusion time of ~ 150 fj.s ifTBI . 

The last two terms of Eq. [T] as well as the motor run lengths of interest, are in the 0. 1-100 s 
range. Neither one of the dynamical models mentioned are able to efficiently explore this time 
scale. And because the dynamic models are tuned to model a specific range of time scales, they 
are not well equipped to explore the sensitivity of TW to the interactions of processes across 
different time scales. To quantitatively determine how sensitive TW is to the parameters in Eq.[T] 
which spans across many orders of magnitude, we must sacrifice single-molecule information 
and instead use a stochastic modeling technique that is able to model processes across many time 
scales: The Master equation. 

4. The Master Equation Approach 

The basis of the Master equation (ME) approach is the identification of motor states and their 
associated transition rates. In the case of the Tumbleweed, motor states are defined as distinct 
combinations of binding between the motor, its ligands, and the DNA track. For example, the 
motor state A*BC represents the situation where foot has its ligand bound but is not attached 
to the DNA, foot Rg has no ligand bound, and foot Rc has both its ligand bound and is attached 
to the DNA track ( * and ~ thus represent ligand and DNA binding, respectively). In total, there 
are 80 distinct states for the TW motor: 2 3 = 8 ligand binding states times 10 DNA-binding 
states, shown enumerated in Table [T]below. The number of possible DNA binding states is larger 
than the number of ligand binding states, as there are three theoretically possible configurations 
if all three feet are bound at the same time. 
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The ME is a differential equation that describes transitions between states. If all kinetics are 
approximated as first-order reactions, the ME is also of first order: 



at 



p(t) = M(t)p(t) 



(2) 



where (for a system with N-states) p(t) is a N-dimensional vector with the numeric value 
of the n'th row representing the probability of finding the TW in the n'th motor state, and M{t) 
is an NxN matrix of transition rates between states. A schematic representation of the allowed 
transitions for the TW motor is shown in Fig. [2] With the allowed transitions between the states 
defined, we can now form the M{t) transition matrix. As shown in the state-naming convention of 
Table[T] it becomes convenient to define separate 8x8 sub-matrices L, describing ligand exchange 
at constant DNA binding /, and D'., which represent a change between DNA binding states i and 
j, respectively. The full M(t) is then: 
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with the two types of sub-matrices defined as: 
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where k'J describes the transition between two DNA binding configurations / and j, while 
keeping the ligand binding configuration I constant, and: 
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where k'_ B ^ AB is the associated rate for the process A*B*C => A*BC for the sustained DNA 
binding configuration i. The term -2 in the L,- matrix is the negative sum over the corresponding 
column of the full matrix, which is included to conserve the population. Each of the 10 L; 
matrices contains 24 independent rates, and each of the 36 D l . matrices is fully defined by 8 
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independent rates, which results in a maximal set of 528 independent parameters to completely 
describe the dynamics of the TW motor. Despite the large number of parameters in the TW 
Master equation, the actual transition matrix M(t) is actually rather sparsely populated and is 
best treated as such to allow for efficient and fast calculation. A strength of the ME approach 
is the inclusion of the wide range of relevant time scales in a single calculation. This strength, 
however, results in a stiff equation, i.e. the equation includes terms that can lead to rapid variation 
and numerical instability with most standard solvers, unless the time steps are chosen extremely 
small. We have chosen a specialized solver for stiff ordinary differential equations (ode5r, GNU 
Octave 1 26 1) to allow us to solve the ME quickly and accurately. 

5. Rates and States 

Many of the compound rates needed to populate the transition matrix Eq. [5] are based on 
the component rates of free repressor proteins, but we must allow for the possibility that they 
are modified by the sterical constraints and effective forces generated by the connecting hub 
and the binding to the DNA recogntion sites. Such modfications can at best be estimated by 
using LD or MD, but they are essentially not precisely known until a functioning prototype of 
the Tumbleweed motor has been constructed. For the moment we reduce the complexity of 
the problem by substituting those component rates known from literature for compound rates. 
In particular, we will consider the case where the ligand interaction L, does not depend on the 
current DNA binding state of the repressor, in essence reducing the set of parameters to the 
quantities introduced in Eq. [T] The most easily experimentally accessible parameters are the 
DNA binding and release times k^°^ dpo and ^° lo / apo j n the presence (holo-) and absence (apo-) 
of ligand, respectively. While detailed information in the literature is sparse, A^° l0 is generally 
found on the order of 5 ■ 10~ 3 s _1 Il27ll28ll29l[30l . and is expected to be experimentally tunable 
by salt concentration. The corresponding is at least 2-3 orders of magnitude greater, with 
preliminary experimental results placing it even higher, on the order of 10 3 s _1 [31 , 32 33]- The 
binding rate depends on the relative concentration of repressor and DNA. In our simulations, 
we replace it by the diffusive search time T&g that we established by a Langevin simulation of the 
TW [ 15 1 and found to be about ~ 150/zs. For the remainder of this paper, we use the following 
values unless specified other wise: = 1CT 2 s _1 , krjg = 10 3 s , Tdiff = 200 jis, and Ti; g = 1 s. 
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Table 1 : Naming convention for the 80 distinct states of the Tumbleweed motor. Roman numerals indicate states with 
identical DNA-binding (10 types), while arabic numbers on the left indicate identical ligand binding (8 types). Alterna- 
tively the states can also be numbered consequtivey from 1 to 80. 
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6. Results 



A typical output from the Master equation model consists of the DNA binding probability 
for each foot as a function of time, as shown in Fig. [3] The ME described here does not yield 
spatial information, such as a trajectory or the speed of the TW, but instead only determines in 
which binding state a molecule would most likely be. Spatial information can be included in 
a ME simulation, but it requires one to define a unique set of states at each spatial coordinate, 
which will drastically increase the complexity of the problem and thus the computational time, 
depending on how long a track one wishes to include. But we can imply stepping information 
from our ME by examining specific state transitions throughout the ligand exchange cycle. For 
example, if the binding probability for is zero during the entire [b,c] ligand plug and then 
returns to one during the [c,a] plug, the motors behavior is consistent with a state transition 
from state ABC to ABC, in agreement with the expected ligand controlled binding sequence 
ABC— > ASC^ ABC^ ABC. 

We are now equipped to return to a question raised by Eq. [T| For Tumbleweed to take suc- 
cessive steps and not fall off the track, what is the required quantitative relationship between the 
difFusional step time (Tdiff), the external ligand exchange time (Tii g ) and the average time for the 
repressor proteins to detach from the track (1/A^° l0 )? 

Tug is determined by the time scale on which the ligand concentration can be changed mi- 
crofluidically, and can be expected to be on the order of 0.1-1 s. Fig. [4] shows the population of 
motors attached to the track after 30 steps as a function of for different values of t^jf with 
rug fixed at 1 s. For Tdiff on the 1- 100 yus time scale, the motor remains fairly successful even as 
l/^ff° becomes less than Tn g , but for Tdiff = 2 ms, we see a 20% reduction in motor attachment 
when l/£iS Ic = Tn g = 1 s. It is initially surprising that the diffusive process can affect stepping 
success even though it is temporally separated from other relevant rates by 3 orders of magnitude. 
A physical interpretation of this finding is as follows: the primary reason for the motor to detach 
from the track is when 1 /k^° becomes less than Ty lg , because the motor is then simply not sticky 
enough. For short Tdiff however, the motor has a good chance to survive anyway, because it is 
attached to the track with two feet most of the time, providing a backup for one-foot detachment 
events. However, the vulnerability for detachment is amplified for increasing Tdiff, because the 
motor is attached with only one foot to the track for a greater fraction of the cycle. 

We can also use the ME approach to investigate experimental design considerations and their 
consequences on the performance of TW. For example, there need to be at least three different 
buffer plugs containing ligands, [a,b] [b,c] [c,a], microfluidically introduced in sequence into 
a microchamber containing the TW and immobilized DNA, to cycle the motor. As there is a 
finite length the plugs must travel before they reach the molecules, there will be some interdiffu- 
sion between plugs, creating pseudo-plugs containing all three ligands. Although, in the current 
molecular design, the TW molecule is sterically hindered from binding all three binding proteins 
at the same time (state ABC is not possible), the addition of triply mixed plugs reduces the tem- 
poral asymmetry of the system and could slow the motor down due to missteps. How sensitive 
is the TW to plug-mixing, and does the experimental design need to be adjusted to compensate? 
The expected experimental concentration profiles with interdiffusion between clean two-ligand 
plugs are shown qualitatively in Fig. [5ja). Although the ME is capable of modeling any ligand 
concentration profile, it is computationally expensive to model a continuous concentration func- 
tion because the transition matrix M(t) has to be recalculated at each program time step. Instead, 
we use the step-function approximated pseudo-plug profile shown in Fig^b), which is charac- 
terized by t i and Tj, the durations of the expected 'clean' plug and the pseudo-plug, respectively, 
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With T] + T2 = 7Hg. 

Fig.[6]shows the probability for state ABC during the [b,c] ligand plug for multiple values of 
Ti/Tiig with T]i g = 1 s. Instead of the 0% binding probability for throughout the entire [b,c] 
plug characteristic for T2 = (Fig. |3J, we now see a non-zero, increasing probability for state 
ABC during the mixed pseudo-plug. Because the ABC probability is 0% during the unmixed 
portion of the [b,c] plug t\, this profile implies two possible behaviors: (1) a fraction of motors 
completes the diffusional step forward to the ABC binding state but then detaches Rq and returns 
to the ABC state during T2; (2) motors release Rp, but do not complete the step during t\ and 
return to state ABC. In either scenario, these motors fall out of phase with the ligand supply, 
resulting in the following binding sequence during the rest of the ligand cycle: ABC (original 
site) — > ABC (misstep) — > ABC (backstep to rearward site) — > ABC (original site). Therefore 
a single misstep causes the motor to temporally stall not just for a single ligand plug but for an 
entire ligand exchange cycle. With rug held constant, the longer t%, the less time the motors have 
to complete a step and the more opportunity they have to return to the previous binding state and 
stall. The maximum percentage of these misstepping motors is determined by the fraction of 
motors in the ABC state at the ligand exchange time (t = 2 s in Fig. [6} and is shown in the inset 
of Fig. © 

The maximum stepping rate of TW is determined by T]i g . What is the effect of ligand mixing 
if we chose to use a shorter Tu g to increase the stepping rate? The inset of Fig. [JJ shows the 
probability for state ABC during the plugs [a,b] and [b,c] as a function of Tzhug with ri; g = 0.1 
s, an order of magnitude less than Fig. [6] Instead of the probability for state ABC going to 0% 
during t\ as we saw in Figs. [3] and [6] for Tu g = 1 s, we now see that for Tz/Tu g > 0.5 a fraction 
of motors remain in state ABC throughout the entire pulse. This effect occurs when T\ becomes 
similar to the characteristic fall-off time of the ABC state (~l/lc^°), so a fraction of motors have 
not yet released R& before the mixed pseudo-plug arrives and thus remain in state ABC. Because 
of the high misstepping percentage for Tz/Tug > 0.5, we also begin to see a reduction in the 
probability for the expected state ABC during the [a,b] ligand plug. The misstepping fraction 
for T]j g =0.1 s is shown in Fig. [JJ Therefore, although decreasing rug has the possibility of 
increasing the speed of TW, the system also becomes more sensitive to high levels of mixing 
that could potentially decrease the motor speed by increasing misstepping events. Detailed ME 
modeling of these interdependencies will be highly useful for motor optimization. 

7. Conclusions 

We have shown how a Master equation approach can be integrated with input from molecular 
dynamics and coarse-grained Brownian dynamics (Langevin) modeling to efficiently explore the 
dependence of the performance of a synthetic molecular motor on variables on a large range of 
time scales. 

For the specific case of the protein-motor concept Tumbleweed, returning to Eq. [T] we 
have seen that the successive stepping success of TW is not remarkably sensitive to Tdiff un- 
less l/k^° and Tjj g are nearly equal. In this case, for an experimentally realistic Tag = 1 s, 
the value of Tdiff can decrease motor attachment by 20% even if it is three orders of magnitude 
less than Ty is . Although the component holoprotein detachment rates (^ lo ) are expected to be 
~ 5 ■ 10~ 3 s _1 (suggesting that l/k^° » Tflg), the compound rates may vary due to steric con- 
straints, and to be safe the design of the TW molecule should be tuned to decrease Tdiff as much 
as possible. Future MD modeling will be used to determine how design choices affect Tdiff, e.g. 
through joint flexibilities and steric hindrances. 
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We have also determined how ligand plug mixing affects the stepping behavior of TW. The 
binding state behavior suggests that plug mixing can lead to missteps and temporary stalling 
events if the motor returns to its previous binding state during the mixed plug time T2 and con- 
sequently falls out of phase with the periodic ligand supply. Reducing Tu g may, in principle, 
increase the speed of TW, but we have found the system also becomes more sensitive to ligand 
mixing, and for high levels of mixing temporary stalling events may actually decrease the speed, 
presenting an optimization challenge. For T]i g = 1 s, a mixing time T2/r;, g = 0.5 gives a mis- 
stepping probability of ~ 2.5% per step. Current experimental designs for the TW motor have 
the possibility of reducing T2/Tu g to ~ 0.05 where the probability of misstepping is negligible, 
thus the ligand mixing should not present a significant problem to the successful function of the 
Tumbleweed motor. 

The authors would like to thank the HFSP Motor Collaboration for inspiration and useful 
discussions, specifically Martin J. Zuckermann for technical advice. This work is supported by 
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Figure 1 : Stepping process of the Tumbleweed. A coordinated, directed stepping of the TW motor is achieved by control 
of the ligand supply (letters a, b, and c), and DNA binding site arrangement. Each repressor protein foot (Ra. ^b. and 
Rc, shown in red, blue and green, respectively) is only able to bind strongly to its corresponding binding site while it has 
its specific ligand bound (indicated by a white star). Ligands are offered in a temporally repeating sequence ("plugs") 
[a,b], [b,c], [c,a\. For the motor to successfully complete a step, the time scales for ligand capture, motor diffusion, and 
DNA binding must obey Eq. ^ , , 




Figure 2: Visual representation of the allowed transitions of the Tumbleweed motor. An allowed transition can either 
change the DNA binding state (indicated by roman numerals, top) or the ligand binding configuration (bottom, a star 
indicates the respective repressor has its ligand bound). A complete list of states is given in Table[T] 
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Figure 3: Attachment of the TW feet (R/^: red circle, R$: blue square, Rq: green diamond) to their specific DNA site 
over two complete ligand cycles. The ligand plugs are indicated over the graph. The values for T^g = 200 /is and the 
detachment rate 1 /k^° = 100 s are chosen such that the motor is expected to complete successive steps. We can infer 
stepping motion by the binding behavior: during the entire [b,c] ligand plug (for t = 1 s - 2 s), we see 100% binding of 
Rg and Sc. while /?a is 0%, but Z?a returns to 100% binding probability during the following [c,a] plug (t = 2 s - 3 s) 
while Rb is 0%, which is consistent with the ligand controlled stepping behavior. 
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Figure 4: Successive stepping, measured by the percentage of motor still attached to the DNA after 30 steps, for four 
values of diffusive search time (rdijf ) as a function of detachment rate . We find that the diffusive search time must 
be several orders of magnitude shorter than the average time for detachment for the motor to remain bound to DNA. For 
all curves the ligand plug length is constant (rij g = 1 s). 
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Figure 5: Interdiffusion between clean, binary ligand plugs of duration (rjig) results in the profile shown in (a), with 
time periods during which all three ligands (a, b, and c) are present in solution at the same time. For our numerical 
computations we use a simplified plug profile with discreet levels (b), characterized by a clean, binary plug of duration 
T\ (< r lig)> and a mixed 'pseudo-plug' of length T2- The ratio T2/rij g defines the degree of the mixing. 
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Figure 6: Attachment probability for state ABC during the [b,c] ligand plug for four values of T2/ru g with i"i; g = 1 s. 
The binding probability profile here implies that a fraction of the motors are either completing the diffusional step to the 
ABC state but then detaching Rq and returning to the previous ABC state, or simply not completing the step. Inset: The 
percentage of motors that misstep when ligands nominally are exchanged (time t = 2 s) as a function of T2/t;&. 




Figure 7: Fraction of misstepping motors after ligand plug exchange [a,b] to [b,c] as a function of T9/T/, g with Tn g = 0.1 
s . Inset: Probability of state ABC for the [a,b] and [b,c] ligand plugs as a function of time for three values of T2 /t&. . As 
the mixing time T9 is increased, the probability of the state ABC during the [b,c\ ligand plug no longer approaches zero 
and increases with higher T2/t/, ? . Misstepping occurs when T2/rug > 0.5, where T2 becomes comparable to l/k JT (0.001 
s) and a fraction of the motors never releases R^ before the mixed pseudo-plug arrives. The high misstepping percentage 
also reduces the expected 100% probability for state ABC during the [a,b] ligand plug. 
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